Feature Words Selection for Knowledge-based Word Sense Disambiguation with Syntactic Parsing
نویسندگان
چکیده
Feature words are crucial clues for word sense disambiguation. There are two methods to select feature words: window-based and dependency-based methods. Both of them have some shortcomings, such as irrelevant noise words or paucity of feature words. In order to solve the problems of the existing methods, this paper proposes two methods to select feature words with syntactic parsing, which are based on phrase structure parsing tree (PTree) and dependency parsing tree (DTree). With the help of syntactic parsing, the proposed methods can select feature words more accurately, which can alleviate the effect of noise words of window-based method and can avoid the paucity of feature words of dependency-based method. Evaluation is performed on a knowledge-based WSD system with a publicly available lexical sample dataset. The results show that both of the proposed methods are superior to window-based and dependency-based methods, and the method based on PTree is better than the method based on DTree. Both of them are preferred strategies to select feature words to disambiguate ambiguous words. Streszczenie. W artykule zaproponowano dwie metody selekcji cech słowa bazujące na analizie składni struktury frazy oraz analizie składni zależności. Badania przeprowadzono przy wykorzystaniu rożnych baz danych. Proponowana metoda ma większą dokładność niż dotychczas stosowane metody: okna i zależności. (Selekcja cech słowa dla jednoznacznego wykrywania znaczenia z syntaktyczną analizą składni)
منابع مشابه
Utilizing Clues in Syntactic Relationship for Automatic Target Word Sense Disambiguation
Multiple translations to the target language are due to several meanings of source words and various target word equivalents, depending on the context of the source word. Thus, an automated approach is presented for resolving target-word selection, based on “word-to-sense” and “sense-to-word” relationship between source words and their translations, using syntactic relationships (subject-verb, ...
متن کاملAutomatic Target Word Disambiguation Using Syntactic Relationships
Multiple target translations are due to several meanings of source words, and various target word equivalents depending on the context of the source word. Thus, an automated approach is presented for resolving target-word selection, based on “word-to-sense” and “sense-to-word” source-translation relationships, using syntactic relationships (subject-verb, verb-object, adjectivenoun). Translation...
متن کاملA model of syntactic disambiguation based on lexicalized grammars
This paper presents a new approach to syntactic disambiguation based on lexicalized grammars. While existing disambiguation models decompose the probability of parsing results into that of primitive dependencies of two words, our model selects the most probable parsing result from a set of candidates allowed by a lexicalized grammar. Since parsing results given by the lexicalized grammar cannot...
متن کاملSRCB-WSD: Supervised Chinese Word Sense Disambiguation with Key Features
This article describes the implementation of Word Sense Disambiguation system that participated in the SemEval-2007 multilingual Chinese-English lexical sample task. We adopted a supervised learning approach with Maximum Entropy classifier. The features used were neighboring words and their part-of-speech, as well as single words in the context, and other syntactic features based on shallow par...
متن کاملNamed Entity Disambiguation using Freebase and Syntactic Parsing
Named Entity Disambiguation (NED) is a fundamental task of semantic annotation for the Semantic Web. The task of Word Sense Disambiguation (WSD) in Ontology-Based Information Extraction (OBIE) aims to establish a link between the textual entity mention and the corresponding class in the ontology. In this paper, we propose a NED process integrated in a rule-based OBIE system for French. We show ...
متن کامل